Striatal interneurons and reward prediction errors (Commentary on Apicella et al.).
نویسنده
چکیده
Cortical-basal ganglia circuits play a vital role in the organization of thoughts and behavior. However, the specific computational processes achieved by these circuits remain unclear. Part of the problem is that the main input nucleus of the basal ganglia, the striatum, receives inputs from very wide regions of cortex and thalamus; electrophysiological studies of single striatal neurons show correspondingly diverse patterns of activity, that often defy ready classification and analysis. However, several specific subtypes of basal ganglia neurons seem to have simpler, more stereotyped patterns of firing. The best known are the midbrain cells that provide dopamine to the striatum (and elsewhere); in an enormously influential series of papers Wolfram Schultz and colleagues (e.g. Schultz, 1998) argued that these provide a unified reward prediction error signal that drives reinforcement-based learning. But another cell class that has also received much attention are the ‘tonically active neurons’ (TANs) encountered in monkey striatum. These are presumed to be cholinergic interneurons, based on the electrophysiological properties of such interneurons in rat in vitro studies. Normally firing at a moderate and steady pace, they show characteristic brief firing pauses in response to a range of salient events. Like dopamine cells, TANs are few in number but nonetheless appear to provide important control over striatal synaptic plasticity, widely considered to be a major substrate of reinforcement learning. While several groups have observed the characteristic TAN pause response, debate continues over several issues that are key to understanding the computational role of this control signal. The TAN pause response is known to be dependent on intact striatal dopamine (Aosaki et al., 1994) – so is it just passing along a signal or does it provide a distinct message to dopamine? Is this message reward-specific, or does it also occur with unexpected aversive events? And does it encode both ‘positive’ errors (an unexpected salient event) and ‘negative’ errors (omission of an expected event), to allow bidirectional control over plasticity? The paper by Apicella et al. (2009) in this issue of EJN, contributes to this ongoing debate. The authors employ a behavioral task in which cues predict forthcoming rewards with varying probability, so that the contribution of expectations to neural firing can be assessed. While this approach has been used before, Apicella et al. (2009) varied the probability of reward across blocks of trials within the same session, rather than using cue-reward probabilities that are fixed over thousands of trials. In line with one aspect of standard reinforcement learning theory, they found that the TAN pause response to reward was diminished when the reward was fully predictable. They then looked at omission of rewards, and found that the TANs split into two groups: one group that increased firing shortly after the expected reward time and another that decreased firing. In addition, even those TANs that did increase firing to reward omission did so with a variable timecourse, in contrast to the stereotypical pause response. This suggests that TANs are not serving as a unified, bidirectional signal encoding both positive and negative reward errors. These results extend a growing body of work indicating that both the cholinergic and the dopaminergic basal ganglia control signals are not as simple and unified as once thought. For example, many TANs care about the spatial location of instruction cues (Ravel et al., 2006), and TANs in the caudate part of striatum seem to care more about the onset of motivationally relevant cues than TANs in putamen, which care more about cues instructing movement onset (Yamada et al., 2004). On the dopamine side, recent papers have shown that many presumed dopamine cells fire more to aversive cues than appetitive cues (Joshua et al., 2008; Matsumoto & Hikosaka, 2009). The challenge for the field is to determine whether such variation reflects a multiplicity of functions for these neurochemical signals, or if there remains a single, underlying fundamental computation that these signals help to accomplish.
منابع مشابه
Time, Not Size, Matters for Striatal Reward Predictions to Dopamine
Midbrain dopamine neurons encode reward prediction errors. In this issue of Neuron, Takahashi et al. (2016) show that the ventral striatum provides dopamine neurons with prediction information specific to the timing, but not the quantity, of reward, suggesting a surprisingly nuanced neural implementation of reward prediction errors.
متن کاملA Cholinergic Feedback Circuit to Regulate Striatal Population Uncertainty
14 Convergent evidence suggeststhat the basal ganglia support reinforcement learning by adjusting 15 action values according to reward prediction errors. However, adaptive behavior in 16 stochasticenvironments requires the consideration of uncertainty to dynamically adjust the 17 learning rate. We consider how cholinergic tonically active interneurons (TANs) may endow the 18 striatum with such ...
متن کاملGuillem R . Esber and Geoffrey Schoenbaum Outcome Expectancy From Prediction Errors Signals Dissociating Attention and ... All
[PDF] [Full Text] [Abstract] , January 26, 2011; 31 (4): 1507-1515. J. Neurosci. Paul Apicella, Sabrina Ravel, Marc Deffains and Eric Legallet during Instrumental Task Performance The Role of Striatal Tonically Active Neurons in Reward Prediction Error Signaling [PDF] [Full Text] [Abstract] , March 16, 2011; 31 (11): 4178-4187. J. Neurosci. Benjamin Y. Hayden, Sarah R. Heilbronner, John M. ...
متن کاملThe role of striatal tonically active neurons in reward prediction error signaling during instrumental task performance.
The detection of differences between predictions and actual outcomes is important for associative learning and for selecting actions according to their potential future reward. There are reports that tonically active neurons (TANs) in the primate striatum may carry information about errors in the prediction of rewards. However, this property seems to be expressed in classical conditioning tasks...
متن کاملRAPID COMMUNICATION Influence of Predictive Information on Responses of Tonically Active Neurons in the Monkey Striatum
Apicella, Paul, Sabrina Ravel, Pierangelo Sardo, and Eric Leend, we compared the activity of tonic striatal neurons during gallet. Influence of predictive information on responses of tonically performance of a simple reaction time task in the presence active neurons in the monkey striatum. J. Neurophysiol. 80: 3341– and absence of an instruction cue presented at a fixed interval 3344, 1998. We ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- The European journal of neuroscience
دوره 30 3 شماره
صفحات -
تاریخ انتشار 2009